Distributed flow optimization and cascading effects in weighted complex networks 
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We investigate the effect of a specific edge weighting scheme ^ (kikj)^ on distributed flow effi- 
ciency and robustness to cascading failures in scale- free networks. In particular, we analyze a simple, 
yet fundamental distributed flow model: current flow in random resistor networks. By the tuning 
of control parameter /3 and by considering two general cases of relative node processing capabili- 
ties as well as the effect of bandwidth, we show the dependence of transport efficiency upon the 
correlations between the topology and weights. By studying the severity of cascades for different 
control parameter /3, we find that network resilience to cascading overloads and network throughput 
is optimal for the same value of /3 over the range of node capacities and available bandwidth. 



PACS numbers: 89.75.Hc, 05.60.Cd, 89.20.Ff 

I. INTRODUCTION 

Distributed flows are ubiquitous in natural and man 
made systems. Examples of such distributed flows in- 
clude but are not limited to: water flow in riverbeds, 
data flow on the Internet, current flow in the power grid, 
nutrient flows in leaves. By studying real- world trans- 
port systems, a natural question appears as how such 
flows can be designed in order to optimize various trans- 
port characteristics, collectively referred to as objective 
functions, e.g. searchability [1], transport efficiency [2|,|3|, 
average packets traveling time [4] , resilience against cas- 
cading overloads [5j and against damages [6]. 

In a large class of transport models in networks, flow is 
assumed to be limited to and directed along the shortest 
paths between the source and destination [3, 0, El • In an- 
other class of flow models, motivated primarily by search 
and discovery in networks, routing of packets can be ran- 
dom, which have led to the studies of regular [9l, |lo|, 
weighted Illl4l4j, or adaptive random walks (RWs) in 
networks [IS^ . In between these two extremes lies a class 
of models where transport and flow is both directed and 
distributed: packets or information flow are distributed 
among all possible paths emanating from the source and 
ending at the target node. In this paper we systemati- 
cally study the simplest such example of a directed, dis- 
tributed flow: currents in resistor networks. 

Resistor networks are arbitrary networks in which all 
edges are resistors with a speciflc "electrical" conduc- 
tance and a pair of nodes have been designated to be the 
source and sink (target) of a current / flowing through 
the network. They have not only been used to capture in- 
herent transport capabilities of the underlying (possibly 
weighted) c omp lex communication or information net- 
works [ill: [l6l - [l8| , but also to describe the transport of 
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carbohydrates in plants Jl 91] , to investigate flow of fluids 
in porous media [20, 2l|, to flnd communities in com- 
plex networks Q , or to construct page-ranking schemes 
for search engines [23]. Further, there are fundamental 
connections between random walks (hitting times) and 
resistor networks (two-point resistance) [9|, [nl, [2J-[2S|- 
These fundamental observables (and the corresponding 
load and betweenness measures) can also serve as a start- 
ing point in routing schemes in actual communication 
networks [2^,^]. 

Here, we present extensive and systematic numerical 
results pertaining to two general cases of relative node 
processing capabilities along with the effect of bandwidth 
on current flows in weighted resistor networks. In the 
following section (Sec. |TT1) we describe the model we use 
to study distributed flows. In Sec. [1111 we study numeri- 
cally the effect of two different node capacities and band- 
width upon flow efficiency. In Sec. IIVI we present results 
about maximizing network robustness against cascading 
failures, and present concluding remarks in Sec. |Vl 



II. MODEL 

In this paper we consider a specific scheme of assign- 
ing conductances to edges — edge weighting scheme— and 
study how the parameter in this scheme affects the effi- 
ciency of fiow and robustness of the network to cascading 
failures. Consider a resistor network having N nodes and 
M edges. The conductance of an arbitrary edge e = (i, j) 
is set to be proportional to the end-point degrees, namely 
Cij ~ aij{kikjY ^ where a^j is the {i^jY^ element of the 
network's adjacency matrix and /3 is a control parameter, 
taking only real values. This choice, also studied before 
in [Tr, '31', I32I, has been motivated by empirical stud- 
ies [33, 34] where edge weights were observed to follow 
a similar trend. An additional motivation is that this 
scheme provides a convenient way of studying topologi- 
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cally (structurally) biased flows as a function of {3 con- 
trol parameter. This parameter allows one to bias the 
current flow predominantly towards large degree nodes 
(hubs) (for positive j3) or to avoid them (for negative j3). 
When /3 = 0, all edge weights are equal, thus the current 
flow is solely influenced by the network topology. 

Currents along the edges of the network are obtained 
following the method summarized in [35]. When / units 
of current flow into the network at a source s and leave at 
a target t, then for an arbitrary node z, charge conserva- 
tion (Kirchoff's law) combined with Ohm's law dictates: 

N 

^a,(F, -F,)=A^^. -^^^), Vi = l,...,iV. (1) 

Keeping in mind that the weighted network Laplacian 
can be written as Lij = dijCi — Cij^ where Ci = 
the system of linear equations ([1]) can be 
transformed into the matrix equation 

LV = X. (2) 

V is the unknown column voltage vector, while Xi is the 
net current flowing into the network at node i, which 
is zero in all cases except the source and target nodes. 
Equation [2] is solvable for voltages, as long as the inverse 
of the Laplacian L is known. As the L matrix is singu- 
lar (all rows and columns of L sum up to zero implying 
that there is an eigenvalue Ai = with a correspond- 
ing constant eigenvector) it can not be directly inverted. 
This, however presents no technical difficulty, as all rel- 
evant physical observables can be expressed in terms of 
the inverse (or pseudo-inverse) Laplacian, deflned in the 
space orthogonal to the zero mode: G = L~^. This is 
achieved by using spectral decomposition of the network 
Laplacian [11, 36j. For example, by choosing the refer- 
ence potential to be the mean voltage [18], Vi = Vi — (F), 
where (V) = (l/N) Vj, one obtains: 

N 

V, = (GX), = G^JI{SJs - Sjt) = I{G,s - Gu), (3) 

for each node i. Thus, for / units of current and for a 
given source/target pair, the current flowing through an 
arbitrary edge e = (i, j) is 

= Cij{V, -Vj) = CijIiGis - Git - Gjs + G,t), (4) 
while the current flowing through node i is given by 

j 

where the sum is taken over all neighbors of node i. So far 
we have considered the general case of / units of current 
entering (leaving) the network at a given source (target) 
pair. For the studies in this paper, with the exception 
of Sec. IIVB[ we assume that unit current flows between 



N source/target pairs simultaneously. Speciflcally, we as- 
sume that all nodes are simultaneously sources and unit 
current flows into the network at each source. For each 
source node a target is chosen randomly and uniformly 
from the remaining N — 1 nodes. Consequently, the net 
current flowing through an arbitrary edge /node gives the 
edge/node current-flow betweenness [351. IStI. Issj : 

N N 
s,t=l s,t=l 

Note, £i = where the sum is over all neigh- 

bors of node i. These quantities capture appropriately 
the amount of information passing through an edge or 
a link in a distributed fashion, and therefore we will re- 
fer to them as edge and vertex loads. We clarify, this 
is different from the quantity 'vertex load' introduced in 
jsol , [4Qt] , defined as the accumulated number of data pack- 
ets passing through a vertex when all pairs of nodes send 
and receive a data packet in unit time transmitted along 
the shortest paths connecting them. Currents along the 
edges and nodes are uniquely determined by the network 
topology and the weight exponent p. Therefore, this is a 
fully deterministic model and the only source of random- 
ness in the problem is in the network structure. 

We restrict our studies to random, uncorrelated scale- 
free networks constructed using the configuration model 
(Zlj characterized by a fat-tailed degree distribution 
P{k) = ck~^ , where c is the normalization constant and 
kma.-mN)^/^ [42|. 

III. VERTEX AND EDGE LOAD LANDSCAPES 

Attributes of the vertex load landscapes for random 
scale-free networks are shown in Fig. [H Positive load- 
degree correlations are observed in Fig. [TJa) for unbiased 
(/3 = 0) fiow and these correlations become stronger as 
the fiow gets increasingly biased towards the hubs (/3 > 0) 
as shown in Fig. [T]^b). The correlations disappear when 
/3 ^ —1; the loads become balanced, however the aver- 
age vertex load in the network increases slightly. As the 
hub-avoiding bias of fiows increases (by lowering /3) rela- 
tively high loads start to appear on small degree nodes, 
contributing to an increase in the average vertex load. 
The same tendency was previously observed [35|, (43j for 
Barabasi- Albert (BA) scale- free graphs [44]. Further- 
more, vertex-load distributions [Fig. [D^c)] show a power- 
law tail for unbiased and hub-biased fiows with gradually 
decreasing exponential cut-offs. The tail significantly di- 
minishes as the fiow is biased against hubs and towards 
small degree nodes e.g., for /3 ^ —0.5 and disappears for 
/3 ~ — 1. Note, that for /3 ~ — 1 the variance of the dis- 
tribution is the smallest among all /3 values. A strong 
correlation between the degree exponent of the network 
and the exponent 5 characterizing the power-law tail of 
the vertex-load distributions for unbiased fiow (/3 = 0) 
can also be observed as shown in Fig. md). For 7 = 2.5, 
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FIG. 1. (a) Scatterplot between vertex load and degree show- 
ing strong correlation between these variables for unbiased 
flow (/3 = 0). Color coded horizontal lines show the average 
vertex load in the network: green solid line for f5 — 0.0, red 
dashed line for f5 — —1.0 and blue dashed line with two dots 
for f5 — —2.0. (b) Average vertex load for different degree 
classes as we vary {5. Data in both (a) and (b) are plotted for 
random scale- free networks with 7 = 2.5. (c) Vertex- load dis- 
tributions calculated on scale-free networks with 7 = 3.0 for 
various values of /3. (d) Vertex- load distributions for unbiased 
flow for different values of degree exponents 7. Data were 
averaged over 400 different network realizations (iV = 10^, 
{k) = 10). 



3.0 and 3.5 we obtain 5 = 2.52 ± 0.01, 3.01 ± 0.02 and 
3.59 ± 0.01, respectively. 

The edge-load distributions for various values of /3 val- 
ues are shown in Fig. [2](a). Large heterogeneity in the 




FIG. 2. (a) Edge- load distributions in random scale- free net- 
works (7 = 3.0) obtained for various values of /3. (b) Edge- 
load distributions for unbiased flow (/3 = 0.0). Network spec- 
iflcations are the same as in Fig. [1] 

edge-load distribution is observed for strong hub-biased 
flows {/3 = 2.0). This heterogeneity diminishes as the hub 
bias is decreased and the tail of distribution becomes ex- 
ponential. In contrast with node loads in Fig. [IJd), the 
edge loads, for unbiased flows are normally distributed as 
shown in Fig. Efb). Interestingly, the edge loads become 
most balanced for P ^ 0.1, when flows are slightly biased 
towards large degree nodes. 



IV. DISTRIBUTED FLOW OPTIMIZATION 

A. Optimal network throughput 

Having considered the load landscapes for different /3 
ranges, we are interested in how to characterize the flow 
efficiency in the network. Here, we characterize traffic 
flow efficiency by network throughput [7, 35]: the maxi- 
mum input current that the network is able to trans- 
port without becoming congested. To begin with, we 
consider two cases. In the flrst case (i) which we call 
node-limited^ nodes have identical processing capability, 
set to unity, and edges have transport capacity (which 
is also interchangeably referred to as bandwidth) b that 
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is unbounded {b = oo). In the second case (ii) which 
we cah edge-limited^ the processing capabihty of every 
node is unbounded and all edges in the network have a 
finite and identical bandwidth (6 = 1). In both cases, 
we now assume that ^ units of current flow between N 
source-target pairs simultaneously. As we increase the 
input current <l>, the node with the highest load will be 
the first to become congested. Thus, for a given net- 
work, we are interested in the problem of choosing the 
link conductances such that the network is least suscep- 
tible to congestion. To fix notations, we denote by $c"'^ 
the node-limited throughput^ which is the maximum in- 
put current for which the network is congestion free in the 
node-limited case, and by <l>c^^ the edge-limited through- 
put, that is the maximum input current for which the 
network is congestion free in the edge-limited case. As 
shown earlier [3, [35| , these quantities are only limited by 
the maximum vertex load imax and maximum edge load 
^{e)max in the network, respectively. Mathematically, 
they are defined as 



1 



$(e) 



(7) 



and their dependence for various link conductances are 
shown in Figs. [3fa) and ^h). For random scale- free 
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FIG. 3. (a) Node-limited throughput and (b) edge-limited 
throughput is shown as function of /3 control parameter ob- 
tained for scale- free and ER networks (N — 10"^, (/c) = 10) 
averaged over 400 network realizations. 

networks, for 7 in the range of [2.5, 3.5], as well as 
for Erdos-Renyi (ER) random graphs [45|, the node- 
limited throughput attains a maximum value for the hub- 
avoiding flow around /3 ^ — 1 (observed also previously 



in scale- free BA network [35|, |43|), contrary to the edge- 
limited throughput attaining an optimal value for the 
weakly hub-biased flow around /3 ^ 0.1 and a slightly 
higher value around /3 ~ 0.2 for ER networks. As it was 
shown in Figs. [TJc) and [2fa) the /3 ^ — 1 and /3 ^ 0.1 
values are not coincidental. These are the values for 
which the vertex and edge loads are balanced, respec- 
tively. We also observe that for all values of /3 and for 
various network topologies, ^'f^ > ^c^\ indicating that 
node-capacity constraints have a more severe effect on 
network throughput than bandwidth constraints. The 
quantities <I>c^^ and <I>c^^ increase as the fatness of the de- 
gree distribution tails reduces. Furthermore, the homoge- 
neous ER graph exhibits a larger node- and edge-limited 
throughput than the heterogeneous scale-free topologies 
with similar characteristics. Finally, the increase of the 
network average degree results in the decrease of the max- 
imum vertex and edge loads, which in turn contributes 
to the increase of $c^^ and ^i^'^ (not shown). 

An interesting application of the above weighted and 
distributed traffic scheme is its implementation to an ac- 
tual empirical network structure. To that end, we an- 
alyzed a July 2006 snapshot of the Internet at the au- 
tonomous system (AS) level [46j. The network, com- 
posed of 22963 nodes has an average degree (k) = 4.22; 
it is characterized by a fat-tailed degree distribution and 
shows disassortative mixing by degree [47] (see inset of 
Fig. m^a)). The vertex load corresponds to the net num- 
ber of data packets passing through an AS. Qualitative 
similarities are observed between Fig. Hl^a) and Fig. [Hb); 
furthermore between Figs. ID^b), ID^c) and Figs. [H^a) and 
[3l^b), respectively. Following the presented results from 
our model so far, it is obvious that the vertex loads would 
be optimally balanced in the hub- avoiding regime. For 
the Internet, the maximum of the node-limited through- 
put occurs around P ~ —1.75 [Fig. IH^b)], a significantly 
lower value than the one obtained for uncorrelated scale- 
free networks. Further, the edge-limited throughput is 
optimal at around /3 ^ [Fig. ID^c)], essentially un- 
changed from uncorrelated scale- free graphs. This be- 
havior of general disassortative networks can be qualita- 
tively understood as follows. Assuming that the scaling 
of the vertex load is dominated by the weighted degree 
Ci = dijCij for each node, one can show that the 
value of p which balances the vertex loads will be lower 
for disassortative networks (such as the Internet) than 
for uncorrelated ones. Therefore, a stronger hub avoid- 
ance is required to balance the loads on a disassortative 
network as compared to an uncorrelated one. On the 
other hand, assuming that the edge load is dominated by 
the weighted couplings Cij [Eq. (|4])], the degree-degree 
correlations have no major impact scaling of the edge 
load. Hence, the value of the weighting parameter which 
balances the edge load remains about /3 ~ 0, unchanged 
from uncorrelated networks. Finally, in both cases one 
can assume that maximal throughput is associated with 
balanced loads. 

In addition to the node-limited and edge-limited cases. 
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FIG. 4. (a) Average vertex load for different degree classes 
obtained for the Internet at AS level ^] for hub-avoiding 
{/3 = -2.0,-1.5) and unbiased {(3 = 0.0) flows. The inset 
shows the average nearest neighbor degree knn as function 
of node degree, (b) Node-limited throughput and (c) edge- 
limited throughput is shown as function of /S control param- 
eter. 



fer current flow, therefore <l>c^^^ = 0. As seen in Figs. 
[31(a) and [31(b) the node capacity constraint < 1 
(for all nodes i) is a more restrictive constraint than 
< 1 (for all edges (i,j)) constraint. Thereby, in 
this model the unit bandwidth 6 = 1 is an upper limit 
of increasing network throughput expressed mathemati- 
cally as ^^^\b = 1) = By lowering the bandwidth 
of edges, we restrict the current flowing through the net- 
work, consequently decreasing the 
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in Fig. [Sl^a). As b decreases we observe the continuum 
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FIG. 5. (a) Node-edge- limited throughput ^c^^^* as function 
of control parameter (3 and finite bandwidth b of edges, (b) 
Cross-sections of the ^c^^'* surface shown in (a) highlighting 
the transition of the optimum value from hub-avoiding flows 
~ —1) to weakly hub-biased flows (/3 ~ 0.1) as bandwidth 
is lowered from b = 0.78 to 6 = 0.02. Data were obtained 
for random scale-free network of degree exponent 7 = 2.5. 
Network speciflcations are the same as in Fig. [1] 



we consider a third case (in) that we call node- edge- 
limited, when all nodes have unit processing capabilities 
and all edges have a flnite, identical bandwidth, b > 0. 
The network is congestion free for any ^ units of cur- 
rent flowing through the network as long as < 1 
and ^£ij < b conditions are satisfled for each node i and 
edge (i, j), respectively. The node- edge-limited through- 
put ^^^^ , that is the maximum input current for which 
the network is congestion free in the node-edge-limited 
case is deflned as: 



^{i^J^J/b} ' 



(8) 



in). 



In the case of inflnite bandwidth, {b = 00) = ^ 
for no bandwidth {b = 0) the network is unable to trans- 



of cases between the node-limited and edge-limited ex- 
tremes. This decrease is accompanied by the shift of the 
optimal value of ^^^^ from the region of hub-avoiding 
flows {/3 < 0) to the region of slightly hub-biased flows 
{/3 > 0) also shown in Fig. [5fb). The overall shape of 

^^J'''\/3,b) surface remains unaltered as we increase 7 de- 
gree exponent (not shown) in agreement with Figs. [3l^a) 
and[3jb). 



B. Heterogeneous allocation of network resources 

In the previously considered cases we implicitly as- 
sumed that the probability of being a source is the same 
across all nodes (1), and so is the probability of being 
a target [1 - (1 - 1/{N - 1))^"^]; additionally, the cur- 
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rent flowing between each source-target pair is identical. 
In the fohowing, we address a particular case of hetero- 
geneous flow where the incoming and outgoing flow rate 
between each source-target pair is proportional to the de- 
gree of source ks and target kt nodes, namely ~ (ksktY ^ 
for p > real. Considered previously in [35|, this model 
of heterogeneous flow is inspired by a study of the world- 
wide air-transportation network [33|, wherein the traflic 
(total number of passengers) handled by each airport was 
observed to scale as ~ k^ ^ where k was the total degree 
of the airport. The appropriately weighted vertex load 
li{p) in this case is deflned as [35| 



N 



{kshyif, Vi = l,...,7V. (9) 



Note, ^i{p = 0) = -^i. Further, we denote by ^^^\p), 
the node-limited weighted throughput and by ^^c\p) the 
edge-limited weighted throughput that are generalizations 
of the node-limited and edge-limited throughputs deflned 
in the previous subsection. Therefore, similar to ^^(^^ and 
^^c^ deflned by (O, their weighted counterparts are only 
limited by the maximum weighted vertex and edge loads, 
respectively. 

In the following, we restrict our studies to the case 
when p = 1.0. Qualitative similarities are observed be- 
tween the insets of Figs. [6l^a), (b) and Figs. [3l^a) and 
[3fb), respectively. Additionally, we observe a shift to- 
wards lower values of {3 in the position of both node- 
limited and edge-limited optimal throughputs. Specifl- 
cally, the <l>c"^^(1.0) attains a maximum value for the hub- 
avoiding flow around /3 ~ —1.5, while <l>c^^(1.0) reaches 
its maximum value for the unbiased flow, when /3 ~ 0. 
The strong hub avoidance arises to compensate for the 
hub bias implicitly introduced by the heterogeneous flow 
(~ k^). The network throughput for this case is signifl- 
cantly lower than for the case of p = 0. 

Further on, we restrict ourselves to the case when there 
is inflnite bandwidth associated to the edges and the 
only flow limiting factor is the processing capacity of 
the nodes, denoted by Qi for all node i in the network. 
Rather than consider an optimization of the functional 
form of the Q distribution, we consider the case when Qi 
follows a parametrized form Qi{X) ~ kf. As previously, 
we consider the constraint that the sum of the node pro- 
cessing capacities is equal to N. We then focus on the 
question of how the node processing capacities can be 
distributed such that the congestion-free throughput of 
the network is maximized. The network is congestion 
free as long as condition 
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FIG. 6. (a) Maximum weighted vertex load and (b) maximum 
weighted edge load as function of /3 control parameter for 
p — 1.0. Data were obtained for scale- free networks (N — 10^, 
(/c) = 10) with various 7 degree exponents, averaged over 400 
network realizations. Insets show the node-limited weighted 
throughput and edge- limited weighted throughput 

$c'^'*(p), respectively, when p — 1.0. 



optimal is shown in Fig. [6l^b). The global maximum of 
the throughput is attained for hub-biased flow (/3 ~ 0.5) 
when A ~ 1.25. As A is increased, not surprisingly, opti- 
mal throughputs occur at increasing values of /3, i.e. for 
increasingly hub-biased flows. However optimal through- 
put values themselves behave non-monotonically, increas- 
ing as A is increased from zero and then decreasing after 
a maximum is attained around A ~ 1.25. The scale- free 
world wide airline transportation network [33] has been 
reported to operate in the hub-biased regime. However, 
to make any conclusive statements whether this network 
operates close to its optimal regime would be debatable 
for lack of empirical data on how the flow of passengers 
between source-target airports as well as airport capaci- 
ties scale with local connectivity. 



Comparing distributed flow to shortest path 
flow 



k^N 



yi 



(10) 



is satisfled. Fig. [Tfa) depicts the ^^\p = 1.0) A) sur- 
face. A clearer picture of the relevant range of A values 
for which this particular network throughput becomes 



Our study has focused on distributed flows that uti- 
lize all possible paths between a source and target node 
pair in proportion to the conductances of these paths. 
As pointed out in the introduction, another commonly 
studied model of flow is one that utilizes solely the short- 
est path(s) between a source/target node pair. In the 
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the relevant range of A values for which the throughput is op- 
timal. Data were obtained for scale-free networks of 7 = 2.5. 
Network specifications are the same as in Fig. (TJ 



latter case the relevant measure of load for the opti- 
mization problem we discuss here is the shortest-path 
betweenness centrality [isl, |49|. We now compare the 
two different flow strategies to examine which one would 
yield a higher network throughput. The node-limited 
throughput $c^\ defined in Sec. IIV Al is plotted for ran- 
dom scale- free and ER networks in Figs. [8fa). Here, the 
weight associated to edge (i, j) is l/Cij and the shortest- 
path betweenness centrality is defined with a similar nor- 
malization as in the case of current-flow betweenness: 
Csii) = - ^)Y.sy^ty^i^st{i)/crst for any node i. 

(Tst{i) denotes the number of those shortest paths be- 
tween source s and target t node that pass through node 
i, while a St gives the total number of shortest paths be- 
tween s and t. The maximum of <l>c^^ for both types of 
flows is attained in the hub-avoiding regime, although the 
current flow requires a stronger hub avoidance {j3 ~ —1) 
than shortest-path flow {j3 ~ —0.5). The optimal values 
for both cases are larger in case of ER graphs: the homo- 
geneous structure of the network contributes positively 
to its transport ability. As seen from Fig. [D^a), for the 
particular system size N = 10^, the maximum value of 
node-limited throughput is larger in the case of shortest 
paths than for currents {^^\sh.p.) > ^^\current)). 
To see how this comparative feature is affected by net- 
work size, we investigated the values of imax on networks 



0.4 
0.3 
0.2 
0.1 


10^ 

■10^ 
10^ 



(a) 



■ 

I ^ 00 SF, Current flow 
I \ ^# SF, Shortest path flow 
ER, Current flow 
ER, Shortest path flow 



/ n/-4-i\ \ 



-3 



-1 



(b) 



Current flow 
Shortest path flow 



10 



10 



N 



10 



FIG. 8. (a) Node-limited network throughput obtained for 
two different flow strategies: current and shortest-path flow, 
for scale-free (7 = 2.5) and ER networks (N = 10^, (/c) = 
10). (b) System size dependence of the largest vertex load on 
unbiased (/3 = 0.0) current (black circles) and shortest path 
flow (blue squares) in scale- free networks ((/c) = 5.0). 



of different sizes, for both kinds of flow. The results 
shown in Fig. [8fb) indicate that shortest path between- 
ness scales faster with N than current flow betweenness. 
Extrapolation of the linear fits indicate that this trend 
{^^c^^sh.p) > ^i'^\current)) holds up to 15000 nodes, 

after which ^^\sh.p.) < ^^\current). The maximum 
vertex load scales with system size as ^ jy^-^^ while the 
highest shortest path betweenness centrality scales with 
system size as ~ ]\f0.80^ value also reported in [7,]. 



V. ROBUSTNESS AGAINST CASCADING 
FAILURES 

Cascading failures embody one of the common vulner- 
abilities of infrastructure networks [50|, Uli- Models of 
cascading failures have been previously studied in (52I - 
55]. Here we study the model of cascading failures intro- 
duced by Motter and Lai [s^ , extending it to the case of 
distributed flows. As defined in [H^, the initially assigned 

processing capacities of nodes are Qi = {l-\-a)£f\ where 

if^ is the current vertex load on node i for unit current 
flowing between all source-target pairs as defined in Sec. 
[in for an undamaged network. Equivalently, each node i 
can handle an excess load aif^ where a > is called tol- 
erance (excess capacity) parameter. A cascading failure 
is then triggered by the removal of node i* with highest 
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load. The loss of node i* results in the redistribution of 
flows through the network that further leads to the re- 
distribution of loads in the network. By assuming that 
failure happens at all nodes whose loads are greater than 
their respective capacities, the (simultaneous) removal 
of all failed nodes results in the redistribution of loads 
among the remaining nodes, thereby resulting in more 
overloaded nodes and more failures. The cascade ends 
when none of the vertex loads exceed their respective ca- 
pacities. 

In the following we analyze cascades triggered by the 
removal of the highest load and ask for what values of 
conductance /3 weighting parameter and a excess capac- 
ity parameter is the network the most resilient to such 
cascades. Following [53], the resilience of a network is 
quantified in terms of the fraction of surviving largest 
connected component after the cascade: G = N' /N^ 
where N' is the number of nodes belonging to the largest 
network component after the cascade and N is the un- 
damaged (connected) network size. 

Network vulnerability to cascades can be increased by 
increasing the excess load capacity of nodes. However, 
this might not always be a feasible, cost-effective solu- 
tion. The question, therefore arises: is there an opti- 
mal conductance weighting scheme of the network that 
would limit the damage resulting from a cascading fail- 
ure for low values of a? As shown in Figs. [9l^a) and 
[nfb) the network resilience G against cascading failure 
for low values of a attains its maximum for hub-avoiding 
flows around j3 ~ —1, the same value for which the 
node-limited network throughput, shown in Sec. IIV A( 
becomes the highest. Hence, a necessary condition for 
scale-free networks (including the BA model) to achieve 
the highest resilience against cascading failures is hav- 
ing a balanced vertex load profile. This condition also 
assures that the node-limited network throughput is op- 
timal (see Sec. IIV A[) . In general, hub-avoiding flows 
result in relatively higher network resilience to cascades 
than hub-biased flows. 

As Fig. [9] already points out, the network resilience 
against cascading failures varies with the degree expo- 
nent of the network. This is shown in detail in Fig. [10] 
for two, relatively low values of a (0.1 and 0.15.) For 
hub avoiding flows [Figs. [TOTa). (b)], increasing 7 yields 
more robust structures; for unbiased and hub biased flows 
[Figs. [TOT c). (d)], increasing 7 yields less robust struc- 
tures. In the latter case, the cascade is triggered by the 
removal of a high degree node [Fig. ID^b)] which in turn 
results in the overload of many low degree nodes. The 
failure of low degree nodes results in a larger damage 
of a scale-free network with narrow degree distribution, 
i.e., 7 = 3.5 compared to a network with broader degree 
distribution, i.e., 7 = 2.5. In the case of hub-avoiding 
flows, the cascade is triggered by the removal of a low 
degree node [Fig. mb)] resulting in the overload of some 
of the large degree nodes. The removal of hub(s) yields a 
larger connected component in a scale-free network with 
narrow P{k) compared to a network with broader P{k). 




FIG. 9. Fraction of the largest surviving network component 
following a cascading failure {G) triggered by the removal of 
the highest vertex load as function of (5 conductance weighting 
parameter for (a) a = 0.10 and (b) a = 0.15 tolerance param- 
eters. Data were obtained for scale- free networks {N — 10^, 
(k) — 4.5) averaged over 60 network realizations. 



The monotonicity of function G(7) for all values of p in- 
dicates that network robustness is solely determined by 
network topology and link conductances for a fixed value 
of Oi^^^^ excess load capacity. 



VI. SUMMARY 

We have studied the problem of optimizing the 
throughput and robustness of networks carrying dis- 
tributed flows, from a design perspective. Specifically, 
we have used a formalism where the coupling between 
(local) network structure and the conductance of indi- 
vidual Hnks can be tuned to get the best results for a 
given objective. This approach has been inspired by the 
observation of correlations between link 'weights' and lo- 
cal topology in various man-made and natural systems 
m, [33 . Whether these systems are tuned for optimal 
throughput or robustness is an open question, although 
there have been some indications of bi olog ical systems 
operating in a near-optimal regime [56|, l57| . Analogous 
weighting schemes have been proven to be efficient for 
optimizing flow in RW jnl-tisi and shortest path routing 
[l, [35^ , and also for optimizing spreading [58]. 

The goal of this paper has been to systematically con- 
sider the optimization of throughput for distributed flows 
under various cases. We find that in uncorrelated scale- 
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free and ER networks, for identical flows between all 
sources and sinks and identical node capacities, each set 
to unity, hub- avoiding flows provide optimal throughput 
in the node-limited case, while weakly hub-biased flows 
do the same in the edge-limited case. The same trend 
is observed qualitatively for the Internet [46|, and also 
when the flows are heterogeneous, i.e., when current be- 
tween each source-target pair scales as the product of 
their degrees. Interestingly though, if the capacities of 
nodes are also distributed heterogeneously (still conserv- 
ing the total capacity), the optimal throughput occurs for 
a hub-biased flow. As mentioned earlier, this has possible 
implications for the case of airline transportation network 
where a similar hub-biased flow is empirically observed. 
Furthermore, flows on the links of metabolic networks 
have also been observed to be hub-biased, although a 
complete analogy to our distributed flow model there is 
unclear. Although similar trends favoring hub-avoiding 
flows and weakly hub-biased flows for node and edge- 
limited cases, respectively, are found for shortest path 
flows, for large systems, perhaps not surprisingly, dis- 
tributed flows are more eflicient at minimizing the max- 
imal load on the network, and thus providing greater 
throughput. Finally, we studied how the robustness of a 
network to cascading failures could be optimized within 
the parameters of our model. Our results indicate that 
for small amounts of excess capacity, networks with het- 
erogeneous degree distributions are maximally robust in 
the regime of hub- avoiding flows. 
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FIG. 10. Fraction of the largest surviving network component 
after a cascading failure {G) in scale- free networks (N — 10^, 
(/c) = 4.5) as function of 7 degree exponent, (a) and (b) 
correspond to the case of hub avoiding flows while (c) and (d) 
correspond to the case of unbiased and hub-biased flows. Data 
is shown for two values of the tolerance parameter: a = 0.1 
((a) and (c)) and a — 0.15 ((b) and (d)). Error bars are 
smaller than symbol size. 
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